A simulation study comparing likelihood and non-likelihood approaches in analyzing overdispersed count data

نویسندگان

  • Gary Grunwald
  • Richard Jones
چکیده

Overdispersed count data are modelled with likelihood and non-likelihood approaches. Likelihood approaches include the Poisson mixtures with three distributions, the gamma, the lognormal, and the inverse Gaussian distributions. Non-likelihood approaches include the robust sandwich estimator and quasilikelihood. In this simulation study, overdispersed count data were simulated under the Poisson mixtures with the gamma, the lognormal and the inverse Gaussian distributions, then analyzed with the five likelihood and non-likelihood approaches. Our results indicated that 1) when the count data are mildly overdispersed, there are virtually no differences in type I error rate, standard error of the main effect, and empirical power among the five methods; 2) when the count data are very overdispersed, none of these five approaches is robust to model misspecification as evaluated by type I error rate, standard error of the main effect, and empirical power. This simulation study raises caution on using non-likelihood method for analyzing very overdispered count data because of likely higher type I error and inappropriate power levels. Unlike non-likelihood approaches, likelihood approaches allow for statistical tests based on likelihood ratios and for checking model fit and provide basis for power and sample size calculations. When likelihood approaches are used, we suggest comparing likelihood values to select the appropriate parametric method for analyzing very overdispersed count data. AMS 2000 subject classifications: Primary 60K35, 60K35; secondary 60K35.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Beta - Binomial and Ordinal Joint Model with Random Effects for Analyzing Mixed Longitudinal Responses

The analysis of discrete mixed responses is an important statistical issue in various sciences. Ordinal and overdispersed binomial variables are discrete. Overdispersed binomial data are a sum of correlated Bernoulli experiments with equal success probabilities. In this paper, a joint model with random effects is proposed for analyzing mixed overdispersed binomial and ordinal longitudinal respo...

متن کامل

The Development of Maximum Likelihood Estimation Approaches for Adaptive Estimation of Free Speed and Critical Density in Vehicle Freeways

The performance of many traffic control strategies depends on how much the traffic flow models have been accurately calibrated. One of the most applicable traffic flow model in traffic control and management is LWR or METANET model. Practically, key parameters in LWR model, including free flow speed and critical density, are parameterized using flow and speed measurements gathered by inductive ...

متن کامل

Modified Maximum Likelihood Estimation in First-Order Autoregressive Moving Average Models with some Non-Normal Residuals

When modeling time series data using autoregressive-moving average processes, it is a common practice to presume that the residuals are normally distributed. However, sometimes we encounter non-normal residuals and asymmetry of data marginal distribution. Despite widespread use of pure autoregressive processes for modeling non-normal time series, the autoregressive-moving average models have le...

متن کامل

Modified signed log-likelihood test for the coefficient of variation of an inverse Gaussian population

In this paper, we consider the problem of two sided hypothesis testing for the parameter of coefficient of variation of an inverse Gaussian population. An approach used here is the modified signed log-likelihood ratio (MSLR) method which is the modification of traditional signed log-likelihood ratio test. Previous works show that this proposed method has third-order accuracy whereas the traditi...

متن کامل

On the EM algorithm for overdispersed count data.

In this paper, we consider the use of the EM algorithm for the fitting of distributions by maximum likelihood to overdispersed count data. In the course of this, we also provide a review of various approaches that have been proposed for the analysis of such data. As the Poisson and binomial regression models, which are often adopted in the first instance for these analyses, are particular examp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008